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Sara McMains, Joseph M. Helierstein, Carlo H. Sequin 

May 2001 Proceedings of the sixth ACM symposium on Solid modeling and 
applications 

Additional Information: lull citation, abstract, references , tilings. 
terms 



Full text available: ^.Dd£L22.MBj 



•:dex 



Many solid modeling applications require information not only about the geometry of an 
object but also about its topology. Most interchange formats do not provide this information, 
which the application must then derive as it builds its own topological data structure from 
unordered, "polygon soup" input. For very large data sets, the topological data structure 
itself can be bigger than core memory, so that a naive algorithm for building it that doesn't 
take virtual memory access p ... 

2 AJSurve^^ 
Robin Williams 

January 1971 ACM Computing Surveys (CSUR), volume 3 issue l 

Additional Information: Mcitatjon, abstract, references, drinks, index 



Full text available: WfififLtfiLSflBi 



terms 



This is a survey of a data structures and their use in computer graphics systems. First, the 
reasons for using data structures are given. Then the sequential, random, and list 
organizations are discussed, and it is shown how they may be used to build complex data 
structures. Representative samples of languages specifically designed for creating and 
manipulating data structures are described next Finally some typical computer graphics 
systems and their data structures are described. It is a ... 

Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies 
on Collaborative research 

Full text available: fl^iffi4J23J&lBl Additional Information: MLsMion, Sb&iJEa&L MSE2&D£gS. in&&J@I&£ 



Understanding distributed applications is a tedious and difficult task. Visualizations based on 
process-time diagrams are often used to obtain a better understanding of the execution of 
the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not provide 
the user with the desired overview of the application. In our experience, such tools display 
repeated occurrences of non-trivial commun ... 
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Marcos Andre Gongalves, Edward A. Fox, Layne T. Watson, IMeill A. Kipp 

April 2004 ACM Transactions on Information Systems (TOIS), volume 22 issue 2 

Full text available: ««ffl316.85 KB) Additional ,nformation: MflWtolL ti&OSL ffitafflfifiS. fiOogs. tadtt 
^ termSj review 

Digital libraries (DLs) are complex information systems and therefore demand formal 
foundations lest development efforts diverge and interoperability suffers. In this article, we 
propose the fundamental abstractions of Streams, Structures, Spaces, Scenarios, and 
Societies (5S), which allow us to define digital libraries rigorously and usefully. Streams are 
sequences of arbitrary items used to describe both static and dynamic (e.g., video) content. 
Structures can be viewed as labeled directed gra ... 

Keywords: applications., definitions, foundations, taxonomy 
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Hanan Samet 

June 1984 ACM Computing Surveys (CSUR), volume 16 issue 2 

Full text available: ^jxj£4 JZ.MB1 Additional Information: fyjlciiajjon, references, £ifiO&£. ind^ terrr$ 
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D. Sriram, R. Joobbani 

January 1985 ACM SIGART Bulletin, issue 91 

Full text available: 1||j?df(& ,79.MBj Additional Information: fuji.crtatipn J abstract 



The papers in this special issue were compiled from responses to the announcement in the 
July 1984 issue of the SIGART newsletter and notices posted over the ARPAnet. The interest 
being shown in this area is reflected in the sixty papers received from over six countries. 
About half the papers were received over the computer network. 

A facility for defining and manipulating generalized data structures 
Billy G. Claybrook 

December 1977 ACM Transactions on Database Systems (TODS), Volume 2 issue 4 

Additional Information: Miration, abstract, references, citings, index 



Full text available: m pdf(2.87 MB) 

^ terms 

A data structure definition facility (DSDF) is described that provides definitions for several 
primitive data types, homogeneous and heterogeneous arrays, cells, stacks, queues, trees, 
and general lists. Each nonprimitive data structure consists of two separate entities—a head 
and a body. The head contains the entry point(s) to the body of the structure; by treating 
the head like a cell, the DSDF operations are capable of creating and manipulating very 
general data structures. A tempi ... 

Keywords: data definition languages, data structure definition facility, data structures, 
database management 
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Full text available: ^ad.S.4^fiMB5. Additional Information: MLsRstfen. abstract mferences, citings, index 

i&txns. 

Software reuse is the process of creating software systems from existing software rather 
than building software systems from scratch. This simple yet powerful vision was introduced 
in 1968. Software reuse has, however, failed to become a standard software engineering 
practice. In an attempt to understand why, researchers have renewed their interest in 
software reuse and in the obstacles to implementing it. This paper surveys the different 
approaches to software reuse found in the ... 

Keywords: abstraction, cognitive distance, software reuse 



9 Ext^aLmem^ 
Jeffrey Scott Vitter 

June 2001 ACM Computing Surveys (CSUR), volume 33 issue 2 

Additional Information: MLsMteD., sbsiraet, .references, citings, index 



Full text available: TO otftf 828.46 KB: 

Data sets in large applications are often too massive to fit completely inside the computers 
internal memory. The resulting input/output communication (or I/O) between fast internal 
memory and slower external memory (such as disks) can be a major performance 
bottleneck. In this article we survey the state of the art in the design and analysis of 
external memory (or EM) algorithms and data structures, where the goal is to exploit locality 
in order to reduce the I/O costs. We consider a varie ... 

Keywords: B-tree, I/O, batched, block, disk, dynamic, extendible hashing, external 
memory, hierarchical memory, multidimensional access methods, multilevel memory, 
online, out-of-core, secondary storage, sorting 



1 0 An XMLQU§iy -S^ H 
Zachary G. Ives, A. Y. Halevy, D. S. Weld 

December 2002 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 11 Issue 4 
Full text available: ■gpdfQSI JJfiLKRi Additional Information: MLsiMon, wsiSMSL .tifiGas. .ilKlsSS-teims 

XML has become the lingua franca for data exchange and integration across administrative 
and enterprise boundaries. Nearly all data providers are adding XML import or export 
capabilities, and standard XML Schemas and DTDs are being promoted for all types of data 
sharing. The ubiquity of XML has removed one of the major obstacles to integrating data 
from widely disparate sources - namely, the heterogeneity of data formats. However, 
general-purpose integration of data across the wide are a also re ... 

Keywords: Data integration, Data streams, Query processing, Web and databases, XML 
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September 2001 Journal on Educational Resources in Computing (JERIC) 

Full text available: Additional Information: function, references . cfeL index terms 
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Arthur G. Duncan 

November 1998 ACM SIGAda Ada Letters , Proceedings of the 1998 annual ACM SIGAda 
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international conference on Ada, volume xvm issue 6 
Full text available: Sp^MQ.M&j Additional Information: Mlejtation, references, jn^ex.terms 



Keywords: Ada, functional programming, infinite data structures, lazy evaluation 
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Gordon B. Davis, John T. Gorgone, J. Daniel Couger, David L Feinstein, Herbert E. 
Longenecker 

December 1996 ACM SIGMIS Database , Guidelines for undergraduate degree programs 
on Model curriculum and guidelines for undergraduate degree 
programs in information systems, volume 28 issue i 

Full text available: ®.pdf(7,24.MBj Additional Information: .fejljsilation, cifincjs 
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.tables 

Bernard Chazelle, joe Kilian, Ronitt Rubinfeld, AyelletTal 

January 2004 Proceedings of the fifteenth annual ACM-SIAM symposium on Discrete 
algorithms 

Full text available: ^.pdfi2§.1. t S8..KBi Additional Information: fejH cl[a.tjorL abstract, H^Lg&S&M. 

We introduce the Bloomier Filter, a data structure for compactly encoding a function with 
static support in order to support approximate evaluation queries. Our construction 
generalizes the classical Bloom filter, an ingenious hashing scheme heavily used in networks 
and databases, whose main attribute— space efficiency— is achieved at the expense of a 
tiny false-positive rate. Whereas Bloom filters can handle only set membership queries, our 
Bloomier filters can deal with arbitrary func ... 

15 Pointy 

Radu Rugina, Martin C. Rinard 

January 2003 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 25 Issue 1 

Full text available: ^i>d{(3B3,2a.KBj Additional Information: MtcjMton, skstratf, M&rjS?)££3, iBrifrXj&fATLS 

This paper presents a novel interprocedural, flow-sensitive, and context-sensitive pointer 
analysis algorithm for multithreaded programs that may concurrently update shared 
pointers. The algorithm is designed to handle programs with structured parallel constructs, 
including fork-join constructs, parallel loops, and conditionally spawned threads. For each 
pointer and each program point, the algorithm computes a conservative approximation of 
the memory locations to which that pointer may point. Th ... 

Keywords: Pointer analysis 
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Periklis Andritsos, Renee J. Miller, Panayiotis Tsaparas 

June 2004 Proceedings of the 2004 ACM SIGMOD international conference on 
Management of data 

Full text available: ^.pdf(2J.7,0o KB). Additional Information: MfiltatiatL abstract, Mere.aces 

Data design has been characterized as a process of arriving at a design that maximizes the 
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information content of each piece of data (or equivalent^, one that minimizes redundancy). 
Information content (or redundancy) is measured with respect to a prescribed model for the 
data, a model that is often expressed as a set of constraints. In this work, we consider the 
problem of doing data redesign in an environment where the prescribed model is unknown 
or incomplete. Specifically, we consider the p ... 

1 7 On. understand ingiypes^d 
Luca Cardelli, Peter Wegner 

December 1985 ACM Computing Surveys (CSUR), volume 17 issue 4 

Additional Information: fyjLcjLajjof}., abstract, reference^ dtincs, Index 



Full text available: fftsdft'4.20 SylBj 

^ " tejms, review 

Our objective is to understand the notion of type in programming languages, present a 
model of typed, polymorphic programming languages that reflects recent research in type 
theory, and examine the relevance of recent research to the design of practical 
programming languages. Object-oriented languages provide both a framework and a 
motivation for exploring the interaction among the concepts of type, data abstraction, and 
polymorphism, since they extend the notion of type t ... 

18 QMaMcuctures 

Zvi Galil, Giuseppe F. Italiano 

September 1991 ACM Computing Surveys (CSUR), Volume 23 issue 3 

Full text available: f| lpdff2 3i MB) Additional Information: full citation, references, chtass. index terres . review 



Keywords: equivalence algorithm, partition, set union, time complexity 



1 9 Special ! 
M. A. Bramer 

April 1972 ACM SIGART Bulletin, issue so 

Full text available: ^.pdfi9J3.MBj Additional Information: Mloltation, abstract 

This collection of articles has been brought together to provide SIGART members with an 
overview of Artificial Intelligence approaches to constructing game-playing programs. Papers 
on both theory and practice are included. 

Luciano Baresi, Mauro Pezze 

January 2005 ACM Transactions on Software Engineering and Methodology (TOSEM), 

Volume 14 Issue 1 

Full text available: ^X5$ifCS3.4.,8S.K.f:33 Additional Information: MUMfea aiasScast mf^Does, io<^ terms 

The article proposes an approach for defining extensible and flexible formal interpreters for 
diagram notations with significant dynamic semantics. More precisely, it addresses semi- 
formal diagram notations that have precisely-defined syntax, but informally defined 
(dynamic) semantics. These notations are often flexible to fit the different needs and 
expectations of users. Flexibility comes from the incompleteness or informality of the 
original definition and results in different interpretations ... 

Keywords: Semi-formal notations, graph transformation, high-level Petri nets, semantics 
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1 External memory algorithms and data structures: dealing with massive data 
Jeffrey Scott Vitter 

June 2001 ACM Computing Surveys (CSUR), volume 33 issue 2 



Full text available: f| oM£2.BJ6 KB). 



Additional Information: Ml cjtatiQ.Q, abstract, references, clijngs, Index 
[grim 



Data sets in large applications are often too massive to fit completely inside the computers 
internal memory. The resulting input/output communication (or I/O) between fast internal 
memory and slower external memory (such as disks) can be a major performance 
bottleneck. In this article we survey the state of the art in the design and analysis of 
external memory (or EM) algorithms and data structures, where the goal is to exploit locality 
in order to reduce the I/O costs. We consider a varie ... 

Keywords: B-tree, I/O, batched, block, disk, dynamic, extendible hashing, external 
memory, hierarchical memory, multidimensional access methods, multilevel memory, 
online, out-of-core, secondary storage, sorting 
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February 1990 ACM SIGPLAN Notices , Proceedings of the second ACM SIGPLAN 

symposium on Principles & practice of parallel programming, volume 25 

Issue 3 

Additional Information: feij. citation, abstract, references, citings, Index 



Full text available: mscifCl MMB}. 



terms 



Programming nonshared memory systems is more difficult than programming shared 
memory systems, since there is no support for shared data structures. Current programming 
languages for distributed memory architectures force the user to decompose all data 
structures into separate pieces, with each piece "owned" by one of the processors in the 
machine, and with all communication explicitly specified by low-level message-passing 
primitives. This paper presents a new programming envir ... 



3 A Survey of Data Stmctures for Cornpiu 
Robin Williams 

January 1971 ACM Computing Surveys (CSUR), Volume 3 issue l 

Full text available: ffifidtU Additional Information: MlcMion, abstract, references, citjngs, index 
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terms 

This is a survey of a data structures and their use in computer graphics systems. First, the 
reasons for using data structures are given. Then the sequential, random, and list 
organizations are discussed, and it is shown how they may be used to build complex data 
structures. Representative samples of languages specifically designed for creating and 
manipulating data structures are described next. Finally some typical computer graphics 
systems and their data structures are described. It is a ... 

James R. Low 

May 1978 Communications of the ACM, volume 21 issue 5 

Full text available: f jjxifM 14 MB) Additional Information: Miration, abstract, reMences, citing index 

The use of several levels of abstraction has proved to be very helpful in constructing and 
maintaining programs. When programs are designed with abstract data types such as sets 
and lists, programmer time can be saved by automating the process of filling in low-level 
implementation details. In the past, programming systems have provided only a single 
general purpose implementation for an abstract type. Thus the programs produced using 
abstract types were often inefficient in space or time. ... 

Keywords: abstract data types, automatic programming, data structures, lists, optimizing 
compilers, sets 
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Moni Naor, Vanessa Teague 

July 2001 Proceedings of the thirty-third annual ACM symposium on Theory of 
computing 

Additional Information: full citation , abstract, references . cSings . Index 



Full text available: Mpdif 308. 07 KBj 

v tsrrns 

Many data structures give away much more information than they were intended to. 
Whenever privacy is important, we need to be concerned that it might be possible to infer 
information from the memory representation of a data structure that is not available 
through its "legitimate" interface. Word processors that quietly maintain old versions of a 
document are merely the most egregious example of a general problem. We deal with data 
structures whose current memory repres ... 

Keywords: algorithms, anti-persistence, data structures, hash table, history independence, 
privacy, security 
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conference on Object-oriented programing, systems, languages, and 

applications, Volume 38 Issue 11 

Full text available: M Pdff340.58 KB; Additional ,nformation: MfiWtaL abstract, references, citings, index 



&rms 

We present a system that accepts a specification of key data structure consistency 
constraints, then dynamically detects and repairs violations of these constraints, enabling 
the program to continue to execute productively even in the face of otherwise crippling 
errors. Our experience using our system indicates that the specifications are relatively easy 
to develop once one understands the data structures. Furthermore, for our set of benchmark 
applications, our system can effectively repair inco ... 

Keywords: data structure invariants, data structure repair 



Out-of-core build of a topological data structure from polygon soup 
Sara McMains, Joseph M. Hellerstein, Carlo H. S6quin 

May 2001 Proceedings of the sixth ACM symposium on Solid modeling and 
applications 

Full text available- Mod* i 22 h\B\ Additional Information: Ml citation , ji^cL references , oitinaiL lode* 
k**"-** " terms 

Many solid modeling applications require information not only about the geometry of an 
object but also about its topology. Most interchange formats do not provide this information, 
which the application must then derive as it builds its own topological data structure from 
unordered, "polygon soup" input. For very large data sets, the topological data structure 
itself can be bigger than core memory, so that a naive algorithm for building it that doesn't 
take virtual memory access p ... 

-Supporting dynamic data structures on riistributed-memory machines 
Anne Rogers, Martin C. Carlisle, John H. Reppy, Laurie J. Hendren 

March 1995 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 17 Issue 2 

Additional Information: full citation, abstract, references, citings, Index 



Full text available: mpdtf 2.05 MB). 

v terms, review 

Compiling for distributed-memory machines has been a very active research area in recent 
years. Much of this work has concentrated on programs that use arrays as their primary 
data structures. To date, little work has been done to address the problem of supporting 
programs that use pointer-based dynamic data structures. The techniques developed for 
supporting SPMD execution of array-based programs rely on the fact that arrays are 
statically defined and directly addressable. Recursive data s ... 

Keywords: dynamic data structures 



1 0 .DM§. Mfuoiure. architectures 
W. K. Giloi, H. K. Berg 

April 1978 Proceedings of the 5th annual symposium on Computer architecture 

Full text available: *scSfiSSS.8a KB) Additjonal Information: M-titailfiEL s&£j£l& references sjfiogs. jndex 
™ * "* * terms 

Computer architectures may be characterized by their operational principle and their 
physical structure. The paper defines these two characteristics for the novel concept of data 
structure architectures (DSAs). The representation and processing of arbitrary data 
structures in such a DSA is demonstrated by examples. It is shown how the functional 
requirements of a DSA can be satisfied by the specific information structure and the physical 
structure of the STARLET architecture introduced in p ... 
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June 2004 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 2004 conference 
on Programming language design and implementation, volume 39 issue 6 

Full text available- « od«JW 16 KB; Additional lnformation: Mixtion, abstract, reference^ citings, index 
^ terms 

While the memory of most machines is organized as a hierarchy, program data are laid out 
in a uniform address space. This paper defines a model of reference affinity, which measures 
how close a group of data are accessed together in a reference trace. It proves that the 
model gives a hierarchical partition of program data. At the top is the set of all data with the 
weakest affinity. At the bottom is each data element with the strongest affinity. Based on 
the theoretical model, the paper p ... 

Keywords: array regrouping, program locality, program transformation, reference affinity, 
reuse signature, structure splitting, volume distance 
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February 1989 Proceedings of the twenty-first annual ACM symposium on Theory of 
computing 

Additional Information: felicitation, abstract, rejferences, citings, Index 



Full text available: ffipdffl 23 MB} 

)s ® jr terms 

Dynamic data structure problems involve the representation of data in memory in such a 
way as to permit certain types of modifications of the data (updates) and certain types of 
questions about the data (queries). This paradigm encompasses many fundamental 
problems in computer science. The purpose of this paper is to prove new lower and upper 
bounds on the time per operation to implement solutions to some familiar dynamic data 
structure problems including list representation, sub ... 

1 3 Software .transactional 

Maurice Herlihy, Victor Luchangco, Mark Moir, William N. Scherer 

July 2003 Proceedings of the twenty-second annual symposium on Principles of 
distributed computing 

Additional Information: MlsMion, abstract references, citings, index 



Full text available: m pdf( V> 1 MBJ 

terms 

We propose a new form of software transactional memory (STM) designed to support 
dynamic-sized data structures, and we describe a novel non-blocking implementation. The 
non-blocking property we consider is obstruction-freedom. Obstruction-freedom is weaker 
than lock-freedom; as a result, it admits substantially simpler and more efficient 
implementations. A novel feature of our obstruction-free STM implementation is its use of 
modular contention managers to ensure progress in practice. We ... 

14 A .loglsM. approach, to.^ 
Russell Turpin 

December 1993 ACM SIGSOFT Software Engineering Notes , Proceedings of the 1st ACM 
SIGSOFT symposium on Foundations of software engineering, Volume is 

Issue 5 

Full text available: ^Dd£l : G7.MB). Additional Information: MLeMion., attract, reference^, M^.terr^s 

The Galois project at the University of Texas is building a programming environment that 
supports the formal development and verification of data structure programs. This 
programming environment supports features such as pointer manipulation and destructive 
update that often make formal treatment difficult. 
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January 1976 Proceedings of the 3rd ACM SIGACT-SIGPLAN symposium on Principles 
on programming languages 

Full text available- -B sdfi 780.37 KB} Additional ,nformatlon: M cMfcn, abstract, reference*, citings, index 

We are all aware of the development of increasingly sophisticated, elaborate, and expensive 
computer programs, particularly in the fields of artificial intelligence, data base 
management, and intelligent systems. The need for techniques to deal with such complexity 
has renewed interest in programming language research. Recent work on structured 
programming, intelligent compilers, automatic program generation and verification, and 
high-level optimization has resulted. A pattern of ... 

1 6 Pointer analysis fp^ 

Radu Rugina, Martin C. Rinard 

January 2003 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 25 Issue 1 

Full text available: ^^df(3S3.29.KB} Additional Information: MlSMion, abstract references, index.te.n75s 

This paper presents a novel interprocedural, flow-sensitive, and context-sensitive pointer 
analysis algorithm for multithreaded programs that may concurrently update shared 
pointers. The algorithm is designed to handle programs with structured parallel constructs, 
including fork-join constructs, parallel loops, and conditionally spawned threads. For each 
pointer and each program point, the algorithm computes a conservative approximation of 
the memory locations to which that pointer may point. Th ... 

Keywords: Pointer analysis 
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Joseph Hummel, Laurie J. Hendren, Alexandru Nicolau 

June 1994 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 1994 conference 
on Programming language design and implementation, volume 29 issue 6 

Additional Information: fyij. ciiaijofi, ab^ract, references chinos, index 



Full text available: mpdfi'l. 27 MB) 

^ terms 

Optimizing compilers require accurate dependence testing to enable numerous, 
performance-enhancing transformations. However, data dependence testing is a difficult 
problem, particularly in the presence of pointers. Though existing approaches work well for 
pointers to named memory locations (i.e. other variables), they are overly conservative in 
the case of pointers to unnamed memory locations. The latter occurs in the context of 
dynamic, pointer-based data structures, used in a variety of a ... 

1 8 The . Qijadtree .and Belated. Hjerarc^ 
Hanan Samet 

June 1984 ACM Computing Surveys (CSUR), volume 16 issue 2 

Full text available: ^i>df(4 ; B7.MBl Additional Information: Ml.cjtatJon, references, cltinas, index terrns 
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Amir Roth, Andreas Moshovos, Gurindar S. Sohi 

October 1998 Proceedings of the eighth international conference on Architectural 
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terms 

We introduce a dynamic scheme that captures the accesspat-terns of linked data structures 
and can be used to predict future accesses with high accuracy. Our technique exploits the 
dependence relationships that exist between loads that produce addresses and loads that 
consume these addresses. By identzj+ing producer-consumer pairs, we construct a compact 
internal representation for the associated structure and its traversal. To achieve a 
prefetching eflect, a small prefetch engine speculatively t ... 

20 Burst tries: a fast, efficient data structure for string keys 

April 2002 ACM Transactions on Information Systems (TOIS), volume 20 issue 2 

Full text available: fftodir324.84 KB) Additional ,nformatjon: MdWto abSSffi*. references, tftags, index 
^ % terms, review 

Many applications depend on efficient management of large sets of distinct strings in 
memory. For example, during index construction for text databases a record is held for each 
distinct word in the text, containing the word itself and information such as counters. We 
propose a new data structure, the burst trie, that has significant advantages over existing 
options for such applications: it uses about the same memory as a binary search tree; it is 
as fast as a trie; and, while not as fast as a ... 

Keywords: Binary trees, splay trees, string data structures, text databases, tries, 
vocabulary accumulation 
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1 S^slon.4A;..RMrQactiy 

Erik D. Demaine, John Iacono, Stefan Langerman 

January 2004 Proceedings of the fifteenth annual ACM-SIAM symposium on Discrete 
algorithms 

Full text available: ^ gaftiS3.43 KB) Additional Information: citation, abstract r references 

We introduce a new data structuring paradigm in which operations can be performed on a 
data structure not only in the present but also in the past. In this new paradigm, called 
retroactive data structures, the historical sequence of operations performed on the data 
structure is not fixed. The data structure allows arbitrary insertion and deletion of operations 
at arbitrary times, subject only to consistency requirements. We initiate the study of 
retroactive data structures by formally def ... 

XML.dats 

Zografoula Vagena, Mirella M. Moro, Vassilis J. Tsotras 

June 2004 Proceedings of the 7th International Workshop on the Web and Databases: 
colocated with ACM SIGMOD/PODS 2004 

Full text available: ^p:dJ(522J.0.KB.i Additional Information: MLcMjon, rejererxces, .index terrns 

XML and semi-structured data is usually modeled using graph structures. Structural 
summaries, which have been proposed to speedup XML query processing have graph forms 
as well. The existent approaches for evaluating queries over tree structured data (i.e. data 
whose underlying structure is a tree) are not directly applicable when the data is modeled as 
a random graph. Moreover, they cannot be applied when structural summaries are 
employed and, to the best of our knowledge, no analo ... 

3 ioigrneiappn^ j 
James Aspnes, Jonathan Kirsch, Arvind Krishnamurthy 

July 2004 Proceedings of the twenty-third annual ACM symposium on Principles of 
distributed computing 

Full text available: ^pgf(23MP..KBj Additional Information: Ml citation, abstract, references, index.terx?)s 

We describe a load-balancing mechanism for assigning elements to servers in a distributed 
data structure that supports range queries. The mechanism ensures both load-balancing 
with respect to an arbitrary load measure specified by the user and geographical locality, 
assigning elements with similar keys to the same server. Though our mechanism is 
specifically designed to improve the performance of skip graphs, it can be adapted to 
provide deterministic, locality-preserving load-balancing to any d ... 
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Anthony J. Winkler, A if red G. Dale 

April 1971 Proceedings of the 1971 international ACM SIGIR conference on 
Information storage and retrieval 

Full text available: ^ »xif» 660.*S KB) Additional Information: full cKation, abstract, references 

An approach to determining an appropriate file structure for a given application is presented, 
by outlining a methodology for comparing some important aspects of data management 
system performance. The aspect chosen for analysis is the processing time required to 
evaluate Boolean functions defined on data values contained within a file structure and 
select elements from the structure satisfying the expression.Two file structures are studied. 
The structures are each combinations of hierarchical a ... 

Keywords: Boolean function evaluation, data management system, data structure, file 
structure, file structure determination, hierarchical file, hierarchy, inverted file, logical entry, 
modeling, processing algorithm, retrieval, simulation, storage structure, update 
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Sara Comai, Ernesto Damiani, Piero Fraternali 

October 2001 ACM Transactions on Information Systems (TOIS), volume 19 issue 4 

i ui iffiPi OA i/n, Additional Information: full citation, abstract, references, cltinas. Index 
Full text available: t£ 1 go h / 0 / . oQ KB } ~* 

The rapid evolution of XML from a mere data exchange format to a universal syntax for 
encoding domain-specific information raises the need for new query languages specifically 
conceived to address the characteristics of XML. Such languages should be able not only to 
extract information from XML documents, but also to apply powerful transformation and 
restructuring operators, based on a well-defined semantics. Moreover, XML queries should 
be natural to write and understand, as nontechnical person ... 

Keywords: Document restructuring, graphical query languages, semantics 
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V. Christophides, A. Rizk 

September 1994 Proceedings of the 1994 ACM European conference on Hypermedia 
technology 

Full text available- fl& d<HT1 32 MB) Additional ,nformation: ?5;l{ citation » references , effing, iMex 

* terms 

Hierarchical logical structure and hypertext links are complementary and can be combined 
to build more powerful document management systems. Previous work exploits this 
complementarity for building better document processors, browsers and editing tools, but 
not for building sophisticated querying mechanisms. Querying in hypertext has been a 
requirement since [19] and has already been elaborated in many hypertext systems, but 
has not yet been used for hypertext systems superimposed on an und ... 

Keywords: hypertexts, information retrieval, object oriented databases, path expressions, 
query languages, structured documents 
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Zachary G. Ives, A. Y. Halevy, D. S. Weld 

December 2002 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 11 Issue 4 
Full text available: ^ grff(S5l.86 KB) Additional Information: M citation, abstract, citings, index terms 

XML has become the lingua franca for data exchange and integration across administrative 
and enterprise boundaries. Nearly all data providers are adding XML import or export 
capabilities, and standard XML Schemas and DTDs are being promoted for all types of data 
sharing. The ubiquity of XML has removed one of the major obstacles to integrating data 
from widely disparate sources - namely, the heterogeneity of data formats. However, 
general-purpose integration of data across the wide are a also re ... 

Keywords: Data integration, Data streams, Query processing, Web and databases, XML 



XML data modeling and storage: XVerter. querying XML data with OR-DBMS 
Humberto Vieira, Gabriela Ruberg, Marta Mattoso 

November 2003 Proceedings of the 5th ACM international workshop on Web 
information and data management 

Full text available: ^pdf(2rLM.!<Bj Additional Information: MLcitsfion, abstract, references, index .term?. 

Storage techniques and queries over XML databases are being widely studied. Most works 
store XML documents in traditional DBMSs in order to take advantage of a well established 
technology and also to store both structured data and XML data within a single system. This 
work proposes a translation mechanism to execute queries expressed on XQuery on top of 
XML documents that are stored in an object DBMS using the DOM implementation in disk. 
Rules for automatic translation from XQuery to SQL3 are pr ... 

Keywords: DOM, SQL3, XML, XQuery, XSLT, object DBMS 



9 XMLguery.proc^ 

Mong Li Lee, Boon Chin Chua, Wynne Hsu, Kian-Lee Tan 

November 2002 Proceedings of the eleventh international conference on Information 
and knowledge management 

Full text available: fl.:xig42LS§. KB) Additional Information: faLdlatlfiQ, Merences, index terms 

Traditionally, XML documents are processed at where they are stored. This allows the query 
processor to exploit pre-computed data structures (e.g., index) to retrieve the desired data 
efficiently. However, this mode of processing is not suitable for many applications where the 
documents are frequently updated. In such situations, efficient evaluation of multiple 
queries over streaming XML documents becomes important. This paper introduces a new 
operator, mqX-scan, which efficiently evaluates mul ... 

Keywords: finite state machines, regular path expressions, streaming XML 
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Shurug Al-Khalifa, Cong Yu, H. V. Jagadish 

June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 
Management of data 

Full text available: -R »dt?242 55 KB\ Addjtiona! ,nformatjon: m oStation » references , mim, mm 

^ * * terms 

XML databases often contain documents comprising structured text. Therefore, it is 
important to integrate "information retrieval style" query evaluation, which is well-suited for 
natural language text, with standard "database style" query evaluation, which handles 
structured queries efficiently. Relevance scoring is central to information retrieval. In the 
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case of XML, this operation becomes more complex because the data required for scoring 
could reside not directly in an element itself but als ... 

11 Proximal nodes: a model to query document databases by content and structure 
Gonzalo Navarro, Ricardo Baeza-Yates 

October 1997 ACM Transactions on Information Systems (TOIS), volume is issue 4 

Full text available: 1fbdff$S0.43 KB \ Addltional Information: Ml citation, abstract, references, citings, Index 
. |^<>,. ,.,.>,..,..>. w terms , re view 

A model to query document databases by both their content and structure is presented. The 
goal is to obtain a query language that is expressive in practice while being efficiently 
implementable, features not present at the same time in previous work. The key ideas of 
the model are a set-oriented query language based on operations on nearby structure 
elements of one or more hierarchies, together with content and structural indexing and 
bottom-up evaluation. The model is evaluated in regard t ... 

Keywords: expressivity and efficiency of query languages, hierarchical documents, 
structured text, text algebras 
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S. Comai 

March 2001 Proceedings of the 2001 ACM symposium on Applied computing 
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Haixun Wang, Sanghyun Park, Wei Fan, Philip S. Yu 

June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 
Management of data 

Full text available: f|pd*244.47 KB) Additional ,nformation: MafcUSQ. titaSBSL references, citing index 
* ' terms 

With the growing importance of XML in data exchange, much research has been done in 
providing flexible query facilities to extract data from structured XML documents. In this 
paper, we propose ViST, a novel index structure for searching XML documents. By 
representing both XML documents and XML queries in structure-encoded sequences, we 
show that querying XML data is equivalent to finding subsequence matches. Unlike index 
methods that disassemble a query into multiple sub-queries, and then joi ... 

14 Ad Hoc Quer^ access capability 
J. Wolfe 

July 1994 Proceedings of the eleventh annual Washington Ada symposium & summer 
ACM SIGAda meeting on Ada 

Full text available: t fl ) MB) Additional Information: full citation, rafcrenctts , index terms 
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June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 
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We present the design and implementation of the XSQ system for querying streaming XML 
data using XPath 1.0. Using a clean design based on a hierarchical arrangement of 
pushdown transducers augmented with buffers, XSQ supports features such as multiple 
predicates, closures, and aggregation. XSQ not only provides high throughput, but is also 
memory efficient: It buffers only data that must be buffered by any streaming XPath 
processor. We also present an empirical study of the performance character ... 



16 An.ov^ 

Ruxandra Domenig, Klaus R. Dittrich 

September 1999 ACM SIGMOD Record, volume 28 issue 3 

Full text available: f§£0MLM KBj Additional Information: Motion, afestod, rifinas, index Mens 

Multimedia technology, global information infrastructures and other developments allow 
users to access more and more information sources of various types. However, the 
"technical" availability alone (by means of networks, WWW, mail systems, databases, etc.) 
is not sufficient for making meaningful and advanced use of all information available on-line. 
Therefore, the problem of effectively and efficiently accessing and querying heterogeneous 
and distributed data sources is an impo ... 

D. E. Wiliard, Y. C. Wee 

January 1988 Proceedings of the fourth annual symposium on Computational geometry 

Full text available- «^864 9.'x KB" Additional lnformation: MxafattaD, abstafit. references, cjtincs, index 

We define a new formalism called the quasi-valid range aggregation. This formalism leads to 
a new and quite simple method for reducing non-range query-like problems to range queries 
and often to orthogonal range queries, with immediate applications to the ATTRACTED 
NEIGHBOR and the planar ALL-PAIRS NEAREST NEIGHBORS problems (the latter being 
solved optimally on both parallel and sequential machines). A point of special interest is that 
our new formalism permits operators th ... 

18 XML and,semistructured data quer yi ng: M easuring similarity between coilectlon of 
values 

Carina F. Dorneles, Carlos A. Heuser, Andrei E. N. Lima, Altigran Soares da Silva, Edleno Silva 
de Moura 

November 2004 Proceedings of the 6th annual ACM international workshop on Web 
information and data management 

Full text available: ^pdi(2545B.KB ■ Additional Information: MLsiMiQJl ?!bstrsl.vi references, iodex.lSiXJSS 

In this paper, we propose a set of similarity metrics for manipulating collections of values 
occuring in XML documents. 



Following the data model presented in TAX algebra, we treat an XML element as a labeled 
ordered rooted tree. Consider that XML nodes can be either atomic, i.e, they may contain 
single values such as short character strings, date, etc, or complex, i.e., nested structures 
that contain other nodes, we propose two types of similarity metrics: MAVs, for atomic 
nodes and MC ... 



Keywords: XML, imprecise queries, similarity functions, vague queries 
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Full text available: fBod«i.44MB) Additional Information: Miction, abstract, references, citings, Index 

Effective support for XML query languages is becoming increasingly important with the 
emergence of new applications that access large volumes of XML data. All existing proposals 
for querying XML (e.g., XQuery) rely on a pattern-specification language that allows path 
navigation and branching through the XML data graph in order to reach the desired data 
elements. Optimizing such queries depends crucially on the existence of concise synopsis 
structures that enable accurate compile-time sele ... 
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1 External memory algorithms and data structures: dealing with massive data 
Jeffrey Scott Vitter 

June 2001 ACM Computing Surveys (CSUR), volume 33 issue 2 

Full text available- H odtfes <6 KB- Additional ^formation: full. citation, abstract, references, citings, Index 

" terms 

Data sets in large applications are often too massive to fit completely inside the computers 
internal memory. The resulting input/output communication (or I/O) between fast internal 
memory and slower external memory (such as disks) can be a major performance 
bottleneck. In this article we survey the state of the art in the design and analysis of 
external memory (or EM) algorithms and data structures, where the goal is to exploit locality 
in order to reduce the I/O costs. We consider a varie ... 

Keywords: B-tree, I/O, batched, block, disk, dynamic, extendible hashing, external 
memory, hierarchical memory, multidimensional access methods, multilevel memory, 
online, out-of-core, secondary storage, sorting 

2 The string B~tree: a new d^a structure fo 
appiicatiQ.QS 

Paolo Ferragina, Roberto Grossi 

March 1999 Journal of the ACM (JACM), volume 46 issue 2 

Full text available: f B g (M383.37 KB- Additional Information: Miration, attract, .references, citings, index 

We introduce a new text-indexing data structure, the String B-Tree, that can be seen as a 
link between some traditional external-memory and string-matching data structures. In a 
short phrase, it is a combination of B-trees and Patricia tries for internal-node indices that is 
made more effective by adding extra pointers to speed up search and update operations. 
Consequently, the String B-Tree overcomes the theoretical limitations of inverted files, B- 
trees, prefix B-trees, s ... 

Keywords: B-tree, Patricia trie, external-memory data structure, prefix and range search, 
string searching and sorting, suffix array, suffix tree, text index 
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Sairam Subramanian, Sridhar Ramaswamy 

January 1995 Proceedings of the sixth annual ACM-SIAM symposium on Discrete 
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Anne Rogers, Martin C. Carlisle, John H. Reppy, Laurie J. Hendren 

March 1995 ACM Transactions on Programming Languages and Systems (TOPLAS), 

Volume 17 Issue 2 

Full text available- I Bjefliaffi MB) Additional lnformation: M.cjiation, abstract, references, citings, Index 

Compiling for distributed-memory machines has been a very active research area in recent 
years. Much of this work has concentrated on programs that use arrays as their primary 
data structures. To date, little work has been done to address the problem of supporting 
programs that use pointer-based dynamic data structures. The techniques developed for 
supporting SPMD execution of array-based programs rely on the fact that arrays are 
statically defined and directly addressable. Recursive data s ... 

Keywords: dynamic data structures 
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Joseph Hummel, Laurie J. Hendren, Alexandru Nicolau 

June 1994 ACM SIGPLAN Notices , Proceedings of the ACM SIGPLAN 1994 conference 
on Programming language design and implementation, volume 29 issue 6 

Additional Information: foil citation, abstract, references , onirics . :f':dsx 



Full text available: m;xjK12/.MBl 

^ terms 

Optimizing compilers require accurate dependence testing to enable numerous, 
performance-enhancing transformations. However, data dependence testing is a difficult 
problem, particularly in the presence of pointers. Though existing approaches work well for 
pointers to named memory locations (i.e. other variables), they are overly conservative in 
the case of pointers to unnamed memory locations. The latter occurs in the context of 
dynamic, pointer-based data structures, used in a variety of a ... 

6 Asynchronous shared memory search structures 
Micah Adler 

June 1996 Proceedings of the eighth annual ACM symposium on Parallel algorithms 
and architectures 

Full text available: ^£ci£13£.MB). Additional Information: MlcMlon, references, dtln.gs, jndexjerrris 
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Many applications depend on efficient management of large sets of distinct strings in 
memory. For example, during index construction for text databases a record is held for each 
distinct word in the text, containing the word itself and information such as counters. We 
propose a new data structure, the burst trie, that has significant advantages over existing 
options for such applications: it uses about the same memory as a binary search tree; it is 
as fast as a trie; and, while not as fast as a ... 
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June 2003 Proceedings of the nineteenth annual symposium on Computational 
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We develop cache-oblivious data structures for orthogonal range searching, the problem of 
finding all T points in a set of N points in IR d lying in a query hyper-rectangle. Cache- 
oblivious data structures are designed to be efficient in arbitrary memory hierarchies. We 
describe a dynamic linear-size data structure that answers d-dimensional queries in 0((N/B) 
I'W+T/B) memory transfers, where B is the block size of any two levels of ... 

Keywords: cache-oblivious, orthogonal range searching 
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The performance of main-memory index structures is increasingly determined by the 
number of CPU cache misses incurred when traversing the index. When keys are stored 
indirectly, as is standard in main-memory databases, the cost of key retrieval in terms of 
cache misses can dominate the cost of an index traversal. Yet it is inefficient in both time 
and space to store even moderate sized keys directly in index nodes. In this paper, we 
investigate the performance of tree structures suitable for ... 

Keywords: B-trees, T-tree, cache coherence, key compression, main-memory indices 
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May 1999 ACM SIGARCH Computer Architecture News , Proceedings of the 26th 

annual international symposium on Computer architecture, volume 27 issue 2 

Full text available: g.pdf(ii3 l 33 KB'i Additional Information: Ml cjMton, abstract, references, citings, Index 

Current techniques for prefetching linked data structures (LDS) exploit the work available in 
one loop iteration or recursive call to overlap pointer chasing latency. Jump pointers, which 
provide direct access to non-adjacent nodes, can be used for prefetching when loop and 
recursive procedure bodies are small and do not have sufficient work to overlap a long 
latency. This paper describes a framework for jump-pointer prefetching (JPP) that supports 
four prefetching idioms: queue, full, chain, an ... 
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This paper presents an analysis technique and a novel program transformation that can 
enable powerful optimizations for entire linked data structures. The fully automatic 
transformation converts ordinary programs to use pool (aka region) allocation for heap- 
based data structures. The transformation relies on an efficient link-time interprocedural 
analysis to identify disjoint data structures in the program, to check whether these data 
structures are accessed in a type-safe manner, and to constru ... 
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May 2001 Proceedings of the sixth ACM symposium on Solid modeling and 
applications 
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Many solid modeling applications require information not only about the geometry of an 
object but also about its topology. Most interchange formats do not provide this information, 
which the application must then derive as it builds its own topological data structure from 
unordered, "polygon soup" input. For very large data sets, the topological data structure 
itself can be bigger than core memory, so that a naive algorithm for building it that doesn't 
take virtual memory access p ... 
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This paper presents an overview of the Cedar programming environment, focusing on its 
overall structure— that is, the major components of Cedar and the way they are organized. 
Cedar supports the development of programs written in a single programming language, 
also called Cedar. Its primary purpose is to increase the productivity of programmers whose 
activities include experimental programming and the development of prototype software 
systems for a high-performance personal computer. T ... 
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Maurice Herlihy, Victor Luchangco, Mark Moir, William N. Scherer 

July 2003 Proceedings of the twenty-second annual symposium on Principles of 
distributed computing 
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We propose a new form of software transactional memory (STM) designed to support 
dynamic-sized data structures, and we describe a novel non-blocking implementation. The 
non-blocking property we consider is obstruction-freedom. Obstruction-freedom is weaker 
than lock-freedom; as a result, it admits substantially simpler and more efficient 
implementations. A novel feature of our obstruction-free STM implementation is its use of 
modular contention managers to ensure progress in practice. We ... 
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Pointer-chasing applications tend to traverse composite data structures consisting of 
multiple independent pointer chains. While the traversal of any single pointer chain leads to 
the serialization of memory operations, the traversal of independent pointer chains provides 
a source of memory parallelism. This article investigates exploiting such interchain memory 
parallelism for the purpose of memory latency tolerance, using a technique called multi- 
chain prefetching. Previous work ... 

Keywords: Data prefetching, memory parallelism, pointer-chasing code 
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Recent works demonstrate several benefits of synthesizing software binaries onto FPGA 
hardware, including incorporating hardware design into established software tool flows with 
minimal impact, porting existing binaries to FPGAs, and even dynamically synthesizing 
software kernels to faster FPGA coprocessors. Those works showed that standard binary 
decompilation methods can recover enough high-level control information to result in 
reasonably-efficient hardware. However, recent synthesis methods ... 

Keywords: FPGA, binaries, decompilation, embedded systems, smart buffers, synthesis 
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Volume 25 Issue 1 
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This paper presents a novel interprocedural, flow-sensitive, and context-sensitive pointer 
analysis algorithm for multithreaded programs that may concurrently update shared 
pointers. The algorithm is designed to handle programs with structured parallel constructs, 
including fork-join constructs, parallel loops, and conditionally spawned threads. For each 
pointer and each program point, the algorithm computes a conservative approximation of 
the memory locations to which that pointer may point. Th ... 

Keywords: Pointer analysis 
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Software-controlled data prefetching offers the potential for bridging the ever-increasing 
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speed gap between the memory subsystem and today's high-performance processors. While 
prefetching has enjoyed considerable success in array-based numeric codes, its potential in 
pointer-based applications has remained largely unexplored. This paper investigates 
compiler-based prefetching for pointer-based applications— in particular, those containing 
recursive data structures. We identify the fundamental ... 
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This paper presents an overview of the Cedar programming environment, focusing primarily 
on its overall structure: the major components of Cedar and the way they are organized. 
Cedar supports the development of programs written in a single programming language, 
also called Cedar. We will emphasize the extent to which the Cedar language, with runtime 
support, has influenced the organization, comprehensibility, and stability of Cedar. Produced 
in the Computer Science Laboratory (CS ... 
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structured data 

Michael Barg, Raymond K. Wong 

October 2001 Proceedings of the tenth international conference on Information and 
knowledge management 

Additional Information: fojl.cjtatipn, abstract, references, citincs, index 



Full text available: mBdO:S2MBl 
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The richness of the XML data format allows data to be structured in a way which precisely 
captures the semantics required by the author. It is the structure of the data, however, 
which forms the basis of all XML query languages. Without at least some notion of the 
structure, a user cannot meaningfully query the data. This problem is compounded when 
one considers that heterogeneous data adhering to different schema are likely to exist in the 
database(s) being queried. This paper proposes a soluti ... 

2 Proxi^ 

Gonzalo Navarro, Ricardo Baeza-Yates 

October 1997 ACM Transactions on Information Systems (TOIS), volume is issue 4 

Additional Information: Mciiatjon, ^st[*ct, reM^nces, cltinas, Index 



Full text available: mpdf(550.A3.KB} 



terms, review 



A model to query document databases by both their content and structure is presented. The 
goal is to obtain a query language that is expressive in practice while being efficiently 
implementable, features not present at the same time in previous work. The key ideas of 
the model are a set-oriented query language based on operations on nearby structure 
elements of one or more hierarchies, together with content and structural indexing and 
bottom-up evaluation. The model is evaluated in regard t ... 

Keywords: expressivity and efficiency of query languages, hierarchical documents, 
structured text, text algebras 
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XML has become the lingua franca for data exchange and integration across administrative 
and enterprise boundaries. Nearly all data providers are adding XML import or export 
capabilities, and standard XML Schemas and DTDs are being promoted for all types of data 
sharing. The ubiquity of XML has removed one of the major obstacles to integrating data 
from widely disparate sources - namely, the heterogeneity of data formats. However, 
general-purpose integration of data across the wide are a also re ... 

Keywords: Data integration, Data streams, Query processing, Web and databases, XML 
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Computing graphical queries oyer X.ML data 
Sara Comai, Ernesto Damiani, Piero Fraternali 

October 2001 ACM Transactions on Information Systems (TOIS), Volume 19 issue 4 

Full text available: f 9gdK707.80KBS Additional Information: M citation, abstract, minces, citings, jndex 

The rapid evolution of XML from a mere data exchange format to a universal syntax for 
encoding domain-specific information raises the need for new query languages specifically 
conceived to address the characteristics of XML. Such languages should be able not only to 
extract information from XML documents, but also to apply powerful transformation and 
restructuring operators, based on a well-defined semantics. Moreover, XML queries should 
be natural to write and understand, as nontechnical person ... 

Keywords: Document restructuring, graphical query languages, semantics 
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June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 
Management of data 
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With the growing importance of XML in data exchange, much research has been done in 
providing flexible query facilities to extract data from structured XML documents. In this 
paper, we propose ViST, a novel index structure for searching XML documents. By 
representing both XML documents and XML queries in structure-encoded sequences, we 
show that querying XML data is equivalent to finding subsequence matches. Unlike index 
methods that disassemble a query into multiple sub-queries, and then joi ... 

XjRQL:.An.XML..gyervjan 
Norbert Fuhr, Kai Gro(3johann 

April 2004 ACM Transactions on Information Systems (TOIS), Volume 22 Issue 2 

Full text available- f j pdff281.91 KB) Additlonal Information: MfiStftelL abStGKt, ££fe£G£8£. £ifiQS& index 

XIRQL ("circle") is an XML query language that incorporates imprecision and vagueness for 
both structural and content-oriented query conditions. The corresponding uncertainty is 
handled by a consistent probabilistic model. The core features of XIRQL are (1) document 
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ranking based on index term weighting, (2) specificity-oriented search for retrieving the 
most relevant parts of documents, (3) datatypes with vague predicates for dealing with 
specific types of content and (4) structural vagueness f ... 

Keywords: Path algebra, XML, XQuery, probabilistic retrieval, ranked retrieval, vague 
predicates 
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April 1971 Proceedings of the 1971 international ACM SIGIR conference on 
Information storage and retrieval 
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An approach to determining an appropriate file structure for a given application is presented, 
by outlining a methodology for comparing some important aspects of data management 
system performance. The aspect chosen for analysis is the processing time required to 
evaluate Boolean functions defined on data values contained within a file structure and 
select elements from the structure satisfying the expression. Two file structures are studied. 
The structures are each combinations of hierarchical a ... 

Keywords: Boolean function evaluation, data management system, data structure, file 
structure, file structure determination, hierarchical file, hierarchy, inverted file, logical entry, 
modeling, processing algorithm, retrieval, simulation, storage structure, update 
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November 2003 Proceedings of the 2003 ACM symposium on Document engineering 

Full text available: ^.pdf(275 KB). Additional Information: fyjj. citatjon, abstract, references, index terras 

Product data exchange is the precondition of business interoperation between Web-based 
firms. However, millions of small and medium sized enterprises (SMEs) encode their Web 
product data in ad hoc formats for electronic product catalogues. This prevents product data 
exchange between business partners for business interoperation. To solve this problem, this 
paper has proposed a novel concept-centric catalogue engineering approach for 
representing, transforming and comparing semantic contexts in a ... 

Keywords: XML product map, XPM, ad hoc product data exchange, concept, context 
comparison, context representation, context transformation, electronic commerce, electronic 
product catalogue, product data integration, semantics 



10 External meffi^ 
Jeffrey Scott Vitter 

June 2001 ACM Computing Surveys (CSUR), volume 33 issue 2 

p_ ... , .. lf;ao . ^ ,, C( , Additional Information: full citation, abstr?-^ 

Full text available: Wi pafi ; 32b. 46 KB • J ' 

^ * terms 

Data sets in large applications are often too massive to fit completely inside the computers 
internal memory. The resulting input/output communication (or I/O) between fast internal 
memory and slower external memory (such as disks) can be a major performance 
bottleneck. In this article we survey the state of the art in the design and analysis of 
external memory (or EM) algorithms and data structures, where the goal is to exploit locality 
in order to reduce the I/O costs. We consider a varie ... 
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Keywords: B-tree, I/O, batched, block, disk, dynamic, extendible hashing, external 
memory, hierarchical memory, multidimensional access methods, multilevel memory, 
online, out-of-core, secondary storage, sorting 
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We present techniques for optimizing queries in memory-resident database systems. 
Optimization techniques in memory-resident database systems differ significantly from those 
in conventional disk-resident database systems. In this paper we address the following 
aspects of query optimization in such systems and present specific solutions for them: (1) a 
new approach to developing a CPU-intensive cost model; (2) new optimization strategies for 
main-memory query processing; (3) new insight into ... 
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Abstract. Due to their expressive power, regular expressions (REs) are quickly becoming an 
integral part of language specifications for several important application scenarios. Many of 
these applications have to manage huge databases of RE specifications and need to provide 
an effective matching mechanism that, given an input string, quickly identifies the REs in 
the database that match it. In this paper, we propose the RE-tree, a novel index structure 
for large databases of RE specifications. Gi ... 

Keywords: Index structure, Regular expressions, Sampling-based approximations, Size 
measures 
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We introduce a new data structuring paradigm in which operations can be performed on a 
data structure not only in the present but also in the past. In this new paradigm, called 
retroactive data structures, the historical sequence of operations performed on the data 
structure is not fixed. The data structure allows arbitrary insertion and deletion of operations 
at arbitrary times, subject only to consistency requirements. We initiate the study of 
retroactive data structures by formally def ... 
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We consider the problem of answering a series of on-line queries on a static database. The 
conventional approach to such problems involves a preprocessing phase which constructs a 
data structure with good search behavior. The data structure is then used to process a 
series of queries without any further reordering. Our approach involves dynamic or query- 
driven structuring of the database, i.e. we process the database only when it is required for 
answering a query. We present optimal algorit ... 
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A major challenge still facing the designers and implementors of database programming 
languages (DBPLs) is that of query optimisation. We investigate algebraic query optimisation 
techniques for DBPLs in the context of a purely declarative functional language that supports 
sets as first-class objects. Since the language is computationally complete issues such as 
non-termination of expressions and construction of infinite data structures can be 
investigated, whilst its declarative nature allows th ... 

Keywords: Algebraic manipulation, Database management, Database programming 
languages, Functional languages, Query optimisation 
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Semantics determines the meaning of language constructs; hence it says much more than 
syntax does about implementing the language. The main purpose of this paper is a formal 
presentation of the meaning of basic language constructs employed in many database 
languages (sublanguages). Therefore, stylized query languages SSL (Sample Selection 
Language) and J (Joins) are introduced, wherein most of the typical entries present in other 
query languages are collected. The semantics of SSL and J are ... 
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